3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
11 GByte Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:WAC: A Corpus of Wikipedia Conversations for Online Abuse Detection
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Noé Cécillon | Wikipedia Comment Corpus | /N |
Documentation:
None
Written
Web Service,
Language Type:
Multilingual
Languages:
English Finnish French German Spanish
Availability:
From Owner
License:
Size:
None Production Status:
Existing-updated
Use:
Lexicon Creation/Annotation
-
Paper title:Multilingualization of Medical Terminology: Semantic and Structural Embedding Approaches
-
Paper track:Terminology/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Long-Huei Chen | Unified Medical Language System (UMLS) Metathesaurus | /N |
Documentation:
None
Written
Web Service,
Language Type:
Bilingual
Languages:
English Spanish
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-updated
Use:
Information Extraction, Information Retrieval
-
Paper title:Multilingualization of Medical Terminology: Semantic and Structural Embedding Approaches
-
Paper track:Terminology/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Long-Huei Chen | Medical Subject Headings (MeSH) | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Bilingual
Languages:
English Scottish Gaelic
Availability:
Freely Available
License:
CreativeCommons BY-SA 3.0
Size:
13000 synsets Production Status:
Newly created-in progress
Use:
Word Sense Disambiguation
-
Paper title:A Major Wordnet for a Minority Language: Scottish Gaelic
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gábor Bella | Unified Scottish Gaelic Wordnet | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German Swedish
Availability:
Part freely available, part through search interface
License:
mixed CC and "for research purposes after registration"
Size:
None tokens Production Status:
Newly created-in progress
Use:
historical linguistic research
-
Paper title:The EDGeS Diachronic Bible Corpus
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gerlof Bouma | EDGeS Diachronic Bible Corpus | /N |
Documentation:
https://spraakbanken.gu.se/en/projects/complex-verb-constructions
Multimodal/Multimedia
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
None Production Status:
Newly created-in progress
Use:
Education, Edutainment
-
Paper title:VROAV: Using Iconicity to Visually Represent Abstract Verbs
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Carlo Strapparava | Vroav | /N |
Documentation:
None
Written
Labelled Dataset of (Near) Duplicate Scholarly Documents Abstracts,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
204 MByte Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word Embeddings
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bikash Gyawali | Scholarly Documents Deduplication Dataset | /N |
Documentation:
None
Written
Corpus Tool,
Language Type:
Multilingual
Languages:
English German Italian
Availability:
Freely Available
License:
BSD
Size:
None OtherProduction Status:
Existing-updated
Use:
Linguistic complexity measurement
-
Paper title:CTAP for Italian: Integrating Components for the Analysis of Italian into a Multilingual Linguistic Complexity Analysis Tool
-
Paper track:Written/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Nadezda Okinina | CTAP | /N |
Documentation:
https://github.com/commul/ctap/tree/master/doc/
Treebank,
Language Type:
Monolingual
Languages:
English
Availability:
License:
LDC
Size:
None Production Status:
Existing-updated
Use:
Corpus Creation/Annotation
-
Paper title:Do you Feel Certain about your Annotation? A Web-based Semantic Frame Annotation Tool Considering Annotators’ Concerns and Behaviors
-
Paper track:Evaluation/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Regina Stodden | Penn WSJ Treebank v.3 | /N |
Documentation:
None
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:Do you Feel Certain about your Annotation? A Web-based Semantic Frame Annotation Tool Considering Annotators’ Concerns and Behaviors
-
Paper track:Evaluation/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Regina Stodden | FrameNet 1.7 | /N |
Documentation:
None




